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Dear Editor, 

Coronaviruses (CoVs) have a broad spectrum in 
humans and other animals, causing asymptomatic infec¬ 
tions or respiratory tract infections, gastroenteritis, and 

1 Q 

neurological diseases of varying severity _ . Co Vs are 
the largest known RNA viruses, belonging to the family 
Coronaviridae . On the basis of serology and genome 
phylogeny, CoVs are divided into four genera named 
Alpha-, Beta-, Delta-, and Gamma-coronavirus^’ 6 . To 
date, six human coronaviruses (HCoVs) have been iden¬ 
tified, including two a-CoVs (HCoV-229E and HCoV- 
NL63) and four (3-CoVs (HCoV-OC43, HCoV-HKUl, 
severe acute respiratory syndrome CoV (SARS-CoV) and 
Middle East respiratory syndrome CoV(MERS-CoV)) . 
Human coronavirus OC43 (HCoV-OC43) is more pre¬ 
dominant than other HCoVs, especially in children and 
elderly persons ’ . In addition to the high nucleotide 
substitution rates, the genotype shift by natural recom¬ 
bination of HCoV-OC43 is thought to be an adapting 
mechanism for maintaining its epidemic^ 9 . Since the first 
isolation of HCoV-OC43 in the 1960s, seven genotypes 
(A-G) have been identified by phylogenetic analysis 
of the main genes, such as spike (S), RNA-dependent 
RNA polymerase (RdRp) and nucleocapsid (N), as well 
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as the whole genome ~~ . However, the molecular epi¬ 
demiological data of HCoV-OC43, especially its genotype 
shift, are scarce in China. Here, we report a novel HCoV- 
OC43 genotype identified in mainland China. 

From November 2014 to November 2016, a prospective 
study was conducted in hospitalized children with com¬ 
munity acquired pneumonia (CAP) at 13 hospitals located 
in mainland China. A total of 2721 cases were enrolled 
into this study. The presence of HCoV-OC43 nucleic acid 
was screened by using an RVP Fast V2 kit (Luminex, 
USA) with a Luminex Magpix after RNA extraction from 
throat swabs or nasopharyngeal aspirates. Total RNA 
from specimens was converted into cDNA using oligo 
(dT) primers and the Superscript IV Reverse Transcrip¬ 
tion System (Invitrogen, Carlsbad, CA). The full-length S, 
RdRp, N, and viral genomes (from the 5'-end of the 
ORFla gene to the 3'-end of the poly-A tail) were 
amplified from HCoV-OC43-positive samples by a gen¬ 
ome walking method involving a total of 44 overlapping 
fragments using a set of specific primers (Supplementary 
Table SI) 8,12 . The genome sequence was determined as 
previously described 8 . Sequences were aligned using the 
ClustalW program implemented in MEGA 5.03 (version 
5.0; Sudhir Kumar, Arizona State University, Tempe, AZ, 
USA). Maximum likelihood (ML) trees of whole-genome 
sequences and the full-length sequences of the S, RdRp, 
and N genes were constructed with the best-fit general 
time reversible model with gamma-distributed rate var¬ 
iation across sites and 1000 bootstrap replicates imple¬ 
mented in MEGA 5.03 14 . Neighbor-joining trees of 24 
known genes and whole genomes were constructed with 
Kimuras two-parameter model and 1000 bootstrap 
pseudoreplicates implemented in MEGA 5.03 4 . To ana¬ 
lyze potential recombination events, the complete 
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Fig. 1 Genetic analysis of the HCoV-OC43 strains based on the whole-genome sequences, a Phylogenetic tree of the HCoV-OC43 strains based 
on whole-genome sequences. The tree was constructed by using the maximum likelihood (ML) method with the best-fit general time reversible 
model with gamma-distributed rate variation across sites and 1000 bootstrap replicates implemented in MEGA 5.03. Bootstrap values over 70% are 
shown in the nodes, b Estimation of pairwise genetic distances between genotype H and genotypes A-G of HCoV-OC43 strains based on the whole- 
genome sequences 


genome sequences of HCoV-OC43 were aligned and 
analyzed using the similarity plot and boot scanning 
method in Simplot (version 3.5.1, http://sray.med.som. 
jhmi.edu/SCRoftware). All sequences generated in this 
study have been deposited in GenBank, and the accession 
numbers are MG 197709 to MG 197723 (Supplementary 
Table S2). The reference sequences were retrieved from 
GenBank on December 2017. 

HCoV-OC43 was detected in 1.5% (42/2721) of enrolled 
cases. A total of 15 whole genomes of HCoV-OC43 were 
obtained from 42 respiratory specimens of OC43-positive 
cases. To identify the genotype of OC43-positive samples, 
the ML phylogenetic trees based on the full-length 
sequences of the S, RdRp, and N genes were con¬ 
structed by using the representative strains of genotypes 
A-G (Supplementary Fig. SI). Phylogenetic analysis of the 
S gene clustered all reference strains in genotypes A-G, 
which agreed with previous reports ,10_13 . The 15 
OC43 strains identified in the present study were orga¬ 
nized into two clusters (Supplementary Fig. 1A). Eight 
OC43 strains, including HZ-459/16, BJ-165/15, BJ-124/ 
15, BJ-221/15, GZFE-26/15, YC-207/15, WZ-522/15, and 
WZ-303/15, clustered into genotype G (D-like) lineage, 
with a high nucleotide similarity (99.0-99.9%) to strains 
isolated in Malaysia. The other seven strains (BJ-112/15, 
BJ-164/15, YC-72/15, YC-55/15, YC-68/15, YC-67/15, 
and CC-23/15) fell into the genotype B lineage and shared 
nucleotide similarity (96.0-99.7%) with strains identified 


in Beijing. However, all the 15 strains from the present 
study had close relationships with the representative 
strains of genotype D or G (D-like) in the ML tree of the 
RdRp gene (Supplementary Fig. IB). Importantly, the 
bootstrap values at several nodes, such as genotype D or 
C, were lower than 70% in the phylogenetic tree of the 
RdRp gene, which led to an unresolved tree. The possible 
reasons for this result maybe due to the highly conserved 
nucleotides compared to the other genes and less genetic 
information in GenBank. Analysis of N genes showed that 
eight strains belonged to genotype G strains in the tree 
of the S genes clustered together, while the other strains 
belonged to genotype B strains in the tree of S genes 
clustered together (Supplementary Fig. 1C). The incon¬ 
gruence of several phylogenetic analyses of different genes 
suggested the occurrence of recombination. 

To further explore the evolutionary characteristics of 
the 15 OC43 strains, a ML tree was generated using the 
whole-genome sequences and was compared to other 
whole genomes of OC43 strains deposited in GenBank. 
These reference strains were divided into genotypes from 
A to G as reported by Oong et al. . Eight OC43 strains 
clustered with genotype G strains circulating in Malaysia 
with high nucleotide similarity (99.2-99.6%). However, 
the other seven OC43 strains clustered into a dependent 
novel lineage (Fig. la). Based on the estimation of the 
intergenotype pairwise genetic distances, the distances of 
the novel lineage compared with genotypes B, C, D, F, and 
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Table 1 Demographic and clinical manifestation of HCoV-OC43 positive cases 


Genotype 

Strains ID 

Collection date 

Age (year) 

Gender 

Clinical manifestation 

G 

BJ-124/15 

2015/5/6 

0.08 

Male 

Cough, Sneezing, nasal discharge 


WZ-303/15 

2015/5/6 

1.58 

Male 

Fever, cough, expectoration 


WZ-522/15 

2015/5/16 

6.58 

Male 

Cough, nasal discharge 


GZYF-26/15 

2015/5/18 

3.83 

Male 

Fever, cough, nasal discharge, headache, sore throat 


BJ-165/15 

2015/6/9 

0.42 

Male 

Fever, cough, expectoration 


BJ-221/15 

2015/8/14 

1.17 

Male 

Fever, expectoration 


YC-207/16 

2016/5/5 

2.75 

Male 

Fever, expectoration 


ZJ-459/16 a 

2016/6/20 

9.25 

Female 

Fever, pectoralgia, pleural effussion 

H 

YC-55/15 

2015/3/12 

3.25 

Female 

Fever, cough, expectoration 


BJ-112/15 

2015/4/22 

0.17 

Female 

Fever, expectoration 


BJ-164/15 

2015/6/2 

0.5 

Female 

Cough, sore throat, hoarseness of voice 


YC-67/15 

2015/6/4 

0.83 

Female 

Fever, cough 


YC-68/15 

2015/6/5 

0.42 

Male 

Fever, expectoration 


YC-72/15 

2015/6/13 

4.92 

Male 

Fever 


CC-23/15 

2015/7/13 

2.33 

Female 

Fever, expectoration 


Severe pneumonia case was indicated by symbol "a" 


G were <0.7%, but the distances were >0.9% when com¬ 
pared with genotypes A and E (Fig. lb). These results 
suggested that the novel lineage had a closer evolutionary 
relationship with genotypes B, C, D, F, and G. Genotype D 
was the descendant of the recombination events between 
genotypes B and C. Genotypes G and F were both D-like 
genotypes, which showed similar recombination patterns 
in most parts of the sequence with genotype D strains, 
except for parts of the nsplO gene. The lowest whole- 
genome-sequence genetic distance between distinct gen¬ 
otypes (A-G) of HCoV-OC43 was 0.26 ± 0.02% (between 
genotypes F and D) in a previous study . According to 
these criteria, the mean distances of a novel lineage 
compared with the other seven identified genotypes ran¬ 
ged from 0.45 ± 0.02% to 0.99 ± 0.01%, which suggested 
that a novel genotype of HCoV-OC43 emerged. The novel 
genotype was designated as genotype H. To further ana¬ 
lyze the recombination structures of genotype H strains, 
neighbor-joining trees of 24 known genes and whole- 
genome sequences were constructed. Eighteen whole- 
genome sequences of HCoV-OC43 strains, which belon¬ 
ged to genotypes A-G, were used as reference strains 
(Supplementary Fig. S2). The seven genotype H strains 
showed a close relationship with the reference strains 
belonging to genotypes D, G (D-like), and F (D-like) in the 
phylogenetic trees of most nonstructural protein genes 
(nspl-nspl6), the NS2a gene, and the HE gene. However, 
the NS4, E, M, N, I and whole-genome sequences were 
clustered with the genotype B strains, and the S gene 
showed a close relationship with genotype B and E strains. 


Subsequently, we constructed a similarity plot and per¬ 
formed boot scanning analysis using full-length genome 
sequences. From the 5'-end of the genome to position 
23,000 nt, genotype H strains showed a greater similarity 
to genotype F (D-like) strains. From positions 23,000 nt to 
27,000 nt, genotype H strains were closely related to 
genotype E strains. From positions 27,000 nt to the 3'-end 
of the genome, genotype H was closely related to geno¬ 
type B. These findings were consistent with the phyloge¬ 
netic analysis of the 24 genes and suggested that the 
occurrence of natural recombination events resulted in 
the emergence of the novel genotype H of HCoV-OC43 
(Supplementary Fig. S3 A and S3B). The demographic and 
clinical profiles of children infected with HCoV-OC43 
genotype G or Hin the present study are summarized in 
Table 1. 

In summary, the present study reported a novel HCoV- 
OC43 recombinant genotype H, which was detected 
among children with CAP in mainland China. The novel 
genotype H might have been generated by recombination 
events among putative parental genotype D-like, genotype 
E, and genotype B strains. Our results emphasize the need 
for continuous surveillance of HCoV-OC43 in mainland 
China to better understand the mechanisms of the phylo 
dynamics of HCoV-OC43. 

Acknowledgements 

This study was approved by the Medical Ethics Committee of the Beijing 
Children's Hospital, Capital Medical University, and informed consent was 
obtained. We would like to acknowledge all the physicians and participants for 
collecting clinical specimens from the patients used in this study. We thank all 





Zhu et al. Emerging Microbes & Infections (2018)7:173 


Page 4 of 4 


the reviewers for comments that improved this manuscript. This work was 
supported by grants from the National Science and Technology Supported 
Projects (grant number: 2013BAI09B11), National Major Science & Technology 
Project for Control and Prevention of Major Infectious Diseases in China 
(2017ZX10103004-004), Beijing Talents Fund (grant number: 

2014000021469G239), Beijing Children's Plospital Young Investigator Program 
(grant number: BCFHYIPB-2016-12) and Basic and Clinical Research Cooperation 
Project of Capital Medical University (grant number: 17JL46). 

Author details 

1 Beijing Key Laboratory of Pediatric Respiratory Infection Diseases, Key 
Laboratory of Major Diseases in Children, Ministry of Education, National 
Clinical Research Center for Respiratory Diseases, National Key Discipline of 
Pediatrics (Capital Medical University), Beijing Pediatric Research Institute, 
Beijing Children's Plospital, Capital Medical University, National Center for 
Children's Health, Beijing, China. 2 The 2nd Affiliated Hospital and Yuying 
Children's Hospital of Wenzhou Medical University, Wenzhou, China. 3 Beijing 
Children's Hospital, Capital Medical University, National Center for Children's 
Health, Beijing, China. 4 The Children's Hospital-Zhejiang University School of 
Medicine, Hangzhou, China. Children's Hospital Capital Institute of Pediatrics, 
Beijing, China. 6 Shengjing Hospital of China Medical University, Shenyang, 
China. Children's Hospital of Chongqing Medical University, Chongqing, 
China. Che First Affiliated Hospital of Guangzhou Medical University, 
Guangzhou, China. Cuangzhou Women and Children's Medical Center, 
Guangzhou, China. 10 Xin Hua Hospital Affiliated to Shanghai Jiao Tong 
University School of Medicine, Shanghai, China. "Yinchuan Women and 
Children Healthcare Hospital, Yinchuan, China. 'Children's hospital of 
Changchun, Changchun, China 

Conflict of interest 

The authors declare that they have no conflict of interest. 


Publisher's note 

Springer Nature remains neutral with regard to jurisdictional claims in 
published maps and institutional affiliations. 

Supplementary Information accompanies this paper at (https://doi.org/ 
10.1038/S41426-018-01 71-5). 

Received: 21 March 2018 Accepted: 2 September 2018 
Published online: 30 October 2018 


References 

1. Masters, P. S. & Perlman, S. in Fields virology 6th edn, Vol. 1 (eds Knipe & 
Howley: Wolters Kluwer, Lippincott Williams & Wilkins) Ch. 28 (Philadelphia, 
2013). 

2. Morfopoulou, S. et al. Human coronavirus OC43 associated with fatal ence¬ 
phalitis. N. Engl. J. Med 375, 497-498 (2016). 

3. St-Jean, J. R. et al. Human respiratory coronavirus OC43: genetic stability and 
neuroinvasion. J. Virol. 78, 8824-8834 (2004). 

4. Coleman, C. M. & Frieman, M. B. Coronaviruses: important emerging human 
pathogens. J. Virol. 88 , 5209-5212 (2014). 

5. Woo, P. C. et al. Discovery of seven novel Mammalian and avian coronaviruses 
in the genus deltacoronavirus supports bat coronaviruses as the gene source 
of alphacoronavirus and betacoronavirus and avian coronaviruses as the gene 
source of gammacoronavirus and deltacoronavirus. J. Virol. 86 , 3995-4008 
( 2012 ). 

6. International Committee on Taxonomy of Viruses. Virus taxonomy: 2017 
release, https://talk.ictvonline.org/taxonomy/ (EC 49, Singapore, 2017). 

7. Su, S. et al. Epidemiology, genetic recombination, and pathogenesis of 
coronaviruses. Trends Microbiol. 24, 490-502 (2016). 

8. Zhang, Y. et al. Genotype shift in human coronavirus OC43 and emergence 
of a novel genotype by natural recombination. J. Infect. 70, 641-650 
(2014). 

9. Kin, N. et al. Genomic analysis of 15 human coronaviruses OC43 (HCoV- 
OC43s) circulating in France from 2001 to 2013 reveals a high intra¬ 
specific diversity with new recombinant genotypes. Viruses 7, 2358-2377 
(2015). 

10. Vijgen, L. et al. Complete genomic sequence of human coronavirus OC43: 
molecular clock analysis suggests a relatively recent zoonotic coronavirus 
transmission event. J. Virol. 79, 1595-1604 (2005). 

11. Vijgen, L. et al. Circulation of genetically distinct contemporary human 
coronavirus OC43 strains. Virology 337, 85-92 (2005). 

12. Lau, S. K. et al. Molecular epidemiology of human coronavirus OC43 
reveals evolution of different genotypes over time and recent emergence 
of a novel genotype due to natural recombination. J. Virol. 85, 11325-11337 
( 2011 ). 

13. Oong, X. Y. et al. Identification and evolutionary dynamics of two novel 
human coronavirus OC43 genotypes associated with acute respiratory 
infections: phylogenetic, spatiotemporal and transmission network analyses. 
Em erg. Microbe. 6 , e3 (2017). 

14. Tamura, K. et al. MEGA5: molecular evolutionary genetics analysis using 
maximum likelihood, evolutionary distance, and maximum parsimony 
methods. Mol. Biol. Evol. 28, 2731-2739 (2011). 


